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Abstract — In environments with increasing uncertainty, such 
as smart grid applications based on renewable energy, planning 
can benefit from incorporating forecasts about the uncertainty 
and from systematically evaluating the utility of the forecast 
information. We consider these issues in a planning framework 
in which forecasts are Interpreted as constraints on the possible 
probability distributions that the uncertain quantity of interest 
may have. The planning goal is to robustly maximize the ex¬ 
pected value of a given utility function, integrated with respect 
to the worst-case distribution consistent with the forecasts. 
Under mild technical assumptions we show that the problem 
can be reformulated into convex optimization. We exploit this 
reformulation to evaluate how informative the forecasts are in 
determining the optimal planning decision, as well as to guide 
how forecasts can be appropriately refined to obtain higher 
utility values. A numerical example of wind energy trading in 
electricity markets illustrates our results. 

I. Introduction 

The intrinsically uncertain nature of renewable energy 
sources, for example their dependence on local weather con¬ 
ditions [1], impedes the reliable operation of the electricity 
grid [2]. Forecasts about the renewable energy generation 
provide a means to cope with this uncertainty. Hence the 
successful incorporation of forecasts into planning grid op¬ 
eration emerges as an important challenge, as well as the 
problem of obtaining forecasts that provide the most valuable 
information for planning. 

Uncertainty is usually considered in a stochastic frame¬ 
work during the grid planning stage. Generation, for exam¬ 
ple, is modeled as a random variable, and planning decisions 
are made a certain length of time before the generation is 
realized at operation time. Planning in this context can pro¬ 
vide probabilistic guarantees, for example, that undesirable 
events will happen with low probability, or that expected 
operation costs/utilities are optimized. Examples can be 
found in, e.g., dispatch problems [3], unit commitment [4], 
and participation in electricity markets [5], [6]. 

Solving such stochastic optimization problems relies on 
the availability of the underlying probability distribution of 
the uncertain quantities. The renewable generation forecasts 
however do not provide complete descriptions of the under¬ 
lying uncertainty, in the sense that they do not completely 
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describe the probability distribution of the generation. Cer¬ 
tain questions arise in this context; 

1) How can forecasts of the quantities of interest be 
incorporated in a stochastic planning framework? 

2) How informative is a set of given forecasts to a specific 
planning problem? 

The first question is crucial to reliably integrate renewable 
generation in the grid. The second question becomes prac¬ 
tically relevant if obtaining forecasts during the planning 
stage incurs significant cost. This could be, for example, the 
computational cost of running complex renewable genera¬ 
tion forecasting algorithms, typically combining numerical 
weather prediction models, historical data, local measure¬ 
ments, and simulations [7]. Alternatively these costs could 
be monetary, as is the case when grid operators and power 
producers do not create their own forecasts but instead 
purchase them in the form of products from companies spe¬ 
cializing in forecasting [8], [9]. Understanding how valuable 
are the forecasts in determining planning decisions could 
help reduce the associated costs of obtaining such forecasts. 

In this paper we take a step towards mathematically 
formalizing and answering the above questions. The simplest 
type of forecast for an uncertain quantity that is modeled 
as random is a point forecast, i.e., a single value thought 
as the most likely or the expected value of the quantity. 
However, more sophisticated types of forecasts, such as 
probabilistic forecasts, have been introduced for the case 
of renewable generation [6], [7]. Unlike point forecasts, 
probabilistic forecasts provide higher fidelity information 
about the probability distribution of the random quantity. 
Prediction intervals, i.e., bounds on the probability that the 
quantity takes values in certain intervals, are a common 
example [7]. 

We consider a general stochastic planning framework 
under probabilistic forecasts (Section [Till. We look for a 
planning decision that maximizes the expected value of a 
given utility function, but the probability distribution of the 
uncertain quantity, with respect to which the expectation is 
computed, is unknown and only described via the forecasts. 
We interpret the probabilistic forecasts as constraints defining 
a set of possible probability distributions for which the 
decision needs to account. Consequently a robust (max-min) 
planning problem is formulated to determine the decision 
that maximizes the worst-case expected utility, where the 
worst-case expectation is selected from the set of probability 
distributions consistent with the forecasts. 

In Section ini] we utilize Lagrange duality theory [10] to 
show that the robust planning problem admits an equivalent 


reformulation to a single maximization problem with the 
introduction of auxiliary (dual) variables. This resulting 
problem is convex under certain relatively mild assumptions 
on the planning utility function, but it includes an infinite 
number of constraints. For the special case of forecasts in 
the form of prediction intervals we show that the constraints 
can be reduced to a finite number (Section rilTAb . resulting in 
a standard (finite-dimensional) convex optimization problem 
that can be solved readily [11]. 

The problem of selecting the worst-case probability distri¬ 
bution given probabilistic descriptions has been discussed 
previously in the context of optimal uncertainty quantifi¬ 
cation [12]. The difference in our formulation however is 
that an additional planning optimization level is consid¬ 
ered. Related planning formulations under uncertainty about 
probability distributions have been considered in, e.g., [13], 
[14] and references therein, where the optimal planning 
is solved in a data-driven fashion based on samples from 
the unknown underlying distribution. Conceptually similar 
approaches have been considered in the context of power 
systems, where expected values in stochastic optimization 
are approximated using samples obtained from wind power 
forecasting algorithms [4], [15]. In contrast, planning in our 
work is decoupled from the data collection and does not 
rely on sample-based approximations, but instead utilizes the 
output of the forecasting procedures, i.e., the probabilistic 
forecasts themselves. 

A further novelty of our approach is that it allows to 
characterize how valuable is the forecast information in 
determining the optimal planning decision. This characteri¬ 
zation follows from a sensitivity analysis of the optimal plan¬ 
ning objective value with respect to the forecast constraints 
(Section HYll. Moreover, under the hypothesis that forecasts 
can be refined by, e.g., a forecasting oracle, we develop a 
sensitivity-driven procedure that sequentially selects forecast 
refinements to yield a higher robust planning utility value. 
Numerical examples of the proposed approach for trading 
wind energy in electricity markets [5], [6] are presented in 
Section [V] Finally, we conclude with a discussion on our 
contributions and future work in Section 

II. Problem description 

We consider a decision-making framework under uncer¬ 
tainty. The unknown quantity of interest, for example the 
unknown renewable power generation, is denoted by x and 
we assume that x takes real values in a subset A” C R 
of the reals. We model a; as a random variable following 
some probability distribution on X. As we will make clear, 
this distribution is not known completely. Hence there is 
uncertainty about the distribution of the random quantity of 
interest. 

Suppose a decision-maker needs to select the value of 
a controlled parameter b before the random quantity x is 
realized, i.e., drawn from its probability distribution. We 
assume b takes values in a convex subset S C M of the 
reals. To rank the different choices for b we assume that a 
function J{x,b) models the utility to the user if decision b 



Bidding b Power generation x 


Fig. 1. Profit J{x^b) from bidding in electricity markets for p = 1, 
q = 1.6. On the left generation is kept constant x = 0.5 while bid b varies. 
On the right bid is kept constant 5 = 0.5 while generation x varies. 


is taken a priori and the unknown quantity takes value x. 
Technically we assume the utility function is continuous in 
both variables x and b, and also concave in b for every value 
of X & X, and concave in x for every value of 5 S S. 

Example 1. Consider the case of a wind power plant 
participating in a day-ahead electricity market [5], [6]. The 
producer provides a capacity bid in the market, which we can 
denote by a decision variable b, representing the estimated 
renewable power generation that will be supplied to the grid 
at a future time interval. We let b take values in the unit 
interval, b G [0,1], which can be interpreted as a normalized 
percentage with respect to the maximum capacity of the 
generator. If we denote by x the actual realized power, also 
normalized in x € [0,1], a simple model [5] of the producer’s 
monetary profit from a realization x after bidding b is 

J{x,b) = pb - q[b - x]+, (1) 

where [0]+ = max{0,O}. The constant p > 0 rewards 
high bids, while the constant q > 0 penalizes a shortfall 
of generation compared to the bid, i.e., when 6 > x. It is 
assumed that q > P, implying that the profit decreases for 
higher shortfalls. The utility function, as can be seen in Fig. 
[H satisfies the convexity assumptions of the general problem 
formulation. 

A common assumption in planning problems under un¬ 
certainty is that the underlying probability distribution of 
the unknown quantity is known. In particular, if the random 
variable x has a probability distribution F, the expected 
utility to the user from a choice b is given by 

'Exr^F Jix,b) = I J{x,b)dF{x), (2) 

J X 

where the integral is over the range of values x G X. The 
value b that maximizes the expected utility in |(2)| becomes 
the optimal decision in this case. 

In this paper however the underlying distribution F is not 
completely known but only forecasts about the random quan¬ 
tity X are available. We consider probabilistic forecasts [6], 
[7], which provide information about the underlying prob¬ 
ability distribution F of x. More specifically we consider 
given (measurable) functions gi{x), for i = 1,... ,n, of the 
random quantity x, and forecasts stating that the expected 

















value of these functions is upper bounded by some parame¬ 
ters £i, i.e., 

E^.^pgi{x) < £i, i = (3) 

Both the functions gi{x) and the parameters Si in these 
inequalities are given to the decision-maker, e.g., provided by 
some forecasting algorithm. We interpret the forecasts |(3)| as 
a set of constraints on the unknown underlying distribution 
F, narrowing the uncertainty of the decision-maker regarding 
the distribution of x. Based on this interpretation, we will 
interchangeably use the terms forecasts and constraints when 
referring to |(3)| 

The constraint-based characterization of the uncertainty in 
1(3)1 matches many common types of forecasts. For example 
bounds on mean value, second moment, or higher-order 
moments can be expressed in the form of |(3)| by appropriately 
selecting the function gi{x). In the following example we 
detail another special case, the prediction intervals. We will 
revisit this case later. 


Example 2. An example motivated by recent forecast meth¬ 
ods for renewable generation [6] are prediction intervals. 
Suppose the renewable generation x takes values in a nor¬ 
malized interval X = [0,1], partitioned into m sub-intervals 
[xi-i,Xi) for i = 1,...,m, where xq = 0, Xm = 1, and the 
forecasts take the form 


bj < P(xi_i < X < Xi) < Si, i = 1,... ,m. (4) 


In other words, this forecast provides upper and lower bounds 
on the probability that x takes value in each of the intervals. 
These forecasts can be reformulated into the form of |(3)| by 
defining the indicator functions 


gi{x) = l{[xi-i,Xi)} 


1 if Xi-i < X < Xi, 

0 otherwise, 


( 5 ) 


for i = 1,... ,m. Then the constraints |(4)| are equivalently 
written in the form of |(3)| as 


]Ex~F3i(a;) < S^, and - g^{x) < -Si- (6) 


A pictorial representation is given in Fig. |2] 


We make the following technical assumption about the 
forecasts provided in |(3)| 

Assumption 1. There exists a probability distribution F 
on X such that the forecasts |(3)| are satisfied with strict 
inequality, i.e., Ea;,,.,^ giix) < £i for all i = 1,..., n. 

This assumption states that the constraints |(3)| are feasible, 
and furthermore that strict feasibility holds. To motivate the 
feasibility, we can think of the actual underlying distribution 
of the random quantity, which is not known to the decision¬ 
maker, as satisfying the constraints |(3)| The strict feasibility 
is assumed for certain technical reasons in order to establish 
the results in the sequel of this paper, but is not practically 
restrictive. The parameters £i can always be perturbed to 
make the strict inequality assumption hold. 

Given the constraint-based interpretation of the forecasts 
in|(3)[ we can alternatively describe the planner’s uncertainty 



Fig. 2. Pictorial representation of prediction intervals (Example |2). The 
true but unknown probability density function of the renewable generation 
is plotted. Forecasts of the generation are available in the form of prediction 
intervals, which are upper and lower bounds on the real probability mass 
placed on m = 6 equal intervals of generation amounts. For illustration 
purposes the prediction intervals are shown by blocks, with the areas under 
the blocks equal to the upper and lower bounds, 5i and respectively, 
according to|(4)| 


concerning the true distribution F of the unknown quantity 
X by a set F containing the (possibly uncountably many) 
distributions that satisfy the forecasts |(3)| The set of all 
distributions consistent with forecasts |(3)| is 

F{£) = {F &Vx : E3„..._f gi{x) < £i, i = l,...,n}, (7) 

where we denote the set of all probability distributions on X 
by Vx- Here we parametrize the set with the forecast bounds 
£i, i = l,...,n, grouped in a vector e. For the current 
problem development, £ is fixed, but we introduce this 
parametrization for later use in Section IIV] By Assumption [T] 
the set F{£) has a nontrivial interior, e.g., the true underlying 
distribution belongs in the set. 

Having defined the utility J{x,b) of different choices b 
to the decision-maker, as well as the possible distributions 
F € F{£) of the random quantity x that the decision-maker 
can anticipate according to the probabilistic forecasts, we 
now pose a planning problem. The decision-maker needs to 
determine the value b that robustly maximizes the worst-case 
expected utility anticipated from the forecasts, mathemati¬ 
cally captured as a robust optimization problem 

P*(e) = maximize inf Jix,b). (8) 

Yn other words, the planner looks for a decision that leads 
to the most favorable objective value assuming the worst- 
case distribution among the ones consistent with the forecast 
P(e). We denote this robustly optimal objective value by 
P*(e), again parametrized by the forecast bounds £ for later 
use. We also denote the optimal planning decision as &*(e), 
assuming it exists. 

The difficulty in solving the robust planning problem |(8)| 
lies in the max-min formulation. For every choice b, one 
needs to solve a minimization problem with respect to the 
distribution F to evaluate how good the choice is, and then 
infer how to improve on b to maximize the objective function. 
To overcome this complexity, in the following section, we 













examine how the robust planning problem |( 8 )| can be equiv¬ 
alently written as a single-layer optimization problem. For 
the special case of forecasts given by prediction intervals 
(Example |2]), we obtain an equivalent convex optimization 
problem from which b* (e) can be determined efficiently. 

We proceed in Section |IV] to characterize how informative 
the n forecasts given by |(3)| are in determining the optimal 
planning decisions. To this end, we examine how sensitive 
the robust planning objective value P*{e) is to changes in 
the forecast parameters e. Based on this analysis, we also 
develop a methodology for improving the planning utility 
by appropriately refining forecasts. 


III. Robustly optimal planning 


The robust planning problem described in |( 8 )| involves 
a max-min structure that makes it hard to determine the 
robustly optimal decision b*{e). In this section we follow 
an alternative path based on Lagrange duality theory [10]. 
Under certain technical conditions the inner minimization 
in 1(8)1 over probability distributions F of the unknown 
quantity x is equivalent to its Lagrange dual (maximization) 
problem. Replacing then the inner minimization in |( 8 )| with 
the equivalent maximization yields a maximization problem 
(with a single optimization layer) over the planning decision 
variable b and additional (dual) variables. We state this result 
in the following theorem. Its proof relies on the results 
of [ 10 ]. 

Theorem 1. If Assumption\I\holds, the robust planning prob¬ 
lem defined in \(8)\ is equivalent to the following optimization 
problem 

n 

- ^ A* ETj - p (9) 

i^l 

n 

J(x, 6) + ^ Ai Pi(a;)-f 77 > 0, Vx G T", 

i=l 

( 10 ) 

b eB, A G K”, 77 G M. (11) 


maximize 

b,X,r] 

subject to 


Proof The proof relies on the strong duality results in [10]. 
Lirst consider, for any value of b, the inner minimization 
in |( 8 )| in which the optimization variable is a probability 
distribution F over the unit interval X. Lormally this opti¬ 
mization variable can be expressed as a signed measure p on 
the space X equipped with the standard Borel tr-algebra. To 
be a probability measure, jj, is required to be positive, denoted 
by p > 0 , meaning that p assigns a positive mass on any 
subset of X that belongs in the standard Borel cr-algebra, and 
also to have a total mass equal to 1, dp(a;) = 1. Under 
this change of variables from probability distributions F to 
positive measures p,, and by the form of the set F in |(7)| the 


inner minimization in |( 8 )| can be equivalently written as 

P( 6 ) = minimize / J{x,b)dp{x) (12) 

Jx 

subject to / gfix) dp{x) < Si, z = 1 ,..., n, 

J X 


/ dp{x) = 1. 
J X 


(13) 

(14) 


The robust optimization problem [( 8 )| is equivalent to P* (e) = 
supbee P{b). 

The problem |(12)||(14)] is a linear program over mea¬ 
sures [10]. It has n -f 1 constraints, stating that the (n -f 1)- 
dimensional vector J^[gi{x),..., gn{x),l\^dp{x) lies in a 
cone with a vertex at point (ei,..., e„, 1) G We now 

claim that if we perturb this vertex point to be anywhere in 
an open ball around the point (ei,..., 1 ), the resulting 

optimization problem of the form|(12)l|(14)|is feasible. 


Claim 1. Let Assumption [7] hold. Then we can find an open 
ball B C around the point (ei, 1) such that 

for any e' G B the set of signed measures 


Ix 9^{x)dp{x) = 


is non-empty. 


This proof of this claim is included in Appendix [Al and is 
a consequence of Assumption [T] 

We then derive the Lagrange dual problem of |(12)||(14)] 
by defining multipliers (dual variables) Ai > 0 for each one 
of the inequality constraints |(13)| i = 1,..., n, as well as 
a multiplier 77 for the equality constraint |(14)| Since every 
finite subset of the space X belongs in the standard Borel 
CT-algebra, and all Dirac probability measures 5{x) at points 
x £ X are candidates for positive measures p > 0 in problem 
1(12)1 it can be shown - see [10, p.ll] - that the Lagrange 
dual problem becomes 

n 

D(b) = maximize — XiSi — r? (16) 

X>0,r] ^ 

- 2=1 

n 

subject to J{x,b)-\- E \gii.x) -f 77 > 0 , 

i=l 

Vx G A’. (17) 

Then, [10, Prop. 3.4] shows that under the result of Prop. [T] 
the optimal values of the minimization in |( 12 )| and the 
maximization in |(16)| are equal, i.e., P{b) = Dib). This 
holds for any variable b € B. Hence, returning to the original 
problem |( 8 )| and recalling that it can be written as P*{e) = 
supbgg P{b), we also have that P*{e) = sup^gg D{b). In 
other words we can replace the minimization over distri¬ 
butions F with the maximization over dual variables A, 77 
in 1(16)1(17)] The resulting problem is a maximization over 
variables b, A, 77 and corresponds exactly to problem [(9)| □ 

According to the theorem, the optimal choice b*{e) of the 
robust planning problem[( 8 )|can be equivalently found by the 




















optimization problem [(^( 1 1)[ and the optimal values of the 
two problems are equal. The advantage of the representation 
in |(9)||(TT)I is that it bypasses the max-min structure of [(8) 
There is a finite number of optimization variables in |(9)||(TT) 
i.e., the decision b, as well as the dual variables 77 G M and 
the vector A G R". The caveat, although, is that the number 
of constraints is infinite and uncountable. Such optimization 
problems are called semi-infinite - see [16] for an overview. 
Nevertheless it is a convex optimization problem since the 
objective is linear and the constraint |( 10) | for each value of x 
defines a convex set due to the concavity of function J{x, b) 
in variable b for any value x. 

General approaches for solving semi-infinite programs can 
be found in [16]. We examine however next the special 
case in which the forecasts |(3)| are given in the form of 
prediction intervals, described in Example |2] In that case 
it turns out that the number of constraints in the equivalent 
problem |(9)||(11)| can be reduced to a finite number. Hence 
we can pose the robust planning decision under prediction 
interval forecasts as a standard finite-dimensional convex 
optimization problem, for which efficient algorithms exist. 

After this special case, we show in the following section 
that the optimal values of the additional variables A in 
the optimization problem |(9)||(11)| can be interpreted as 
indicators of how valuable are the given forecasts in deter¬ 
mining the optimal planning. Based on this interpretation, 
we also examine in the following section how updates on 
the forecasts can yield a higher planning objective value. 

A. Robust planning under prediction intervals 

Consider forecasts given in the form of prediction intervals 
described in |(4)| By their reformulation into the generic form 
of forecasts presented in |( 6 )| we can pose the robust plan¬ 
ning decision problem |( 8 )| into its equivalent form provided 
by Theorem [T] in |(9)|(11)| In particular, introducing dual 
variables Ai,..., Am for the upper bound constraints in |( 6 )| 
and A]^,..., A^ similarly for the lower bounds, we have that 
robust planning follows from the solution to the problem 

m 

-1 ( 18 ) 

m 

J{x,b) +^(Ai - AJ 

i=l 

-I -77 > 0, for all X G [0,1]. (19) 

We note that the continuum of constraints |(19)| can be 
equivalently separated into the given intervals 

J(x, 6 )-I-(Ai - Aj)-f ?7 > 0, for all X G [xi_i,Xi), (20) 

for all i = 1,... ,m. Note also that by the assumption that 
the function J(x, b) is concave in x for all b, it follows that 
J{x,b) > min{ J(xi_i, &), J(xi, 6 )}, where the inequality 
is tight by continuity of function J{x,b). Hence instead of 
checking |(20)| for a continuum of values x G [xi_i,Xi) 
for each i = 1,... ,m, we only need to check at the two 


endpoints Xi_i and Xi. In other words, the problem |( 1 8)| - 
1(19)1 can be equivalently written as 

m 

maximize > (A,- S, — XiSA — ri (21) 

bGB, \>o, n ^ ' 

i=l 

subject to J{xi-i,b) -b Ai — A^ -f 77 > 0 , 

J (xi, (>) -b Ai — Aj -b 77 > 0 , 

for alH = 1 ,..., 771. ( 22 ) 

This optimization problem is convex and has a finite number 
of constraints. The number of constraints is twice the number 
of initial forecast intervals because essentially each forecast 
of the robust optimization is converted into two constraints, 
one for each of the two interval endpoints. The solution 
of 1 ( 21)1 can be determined efficiently by standard convex 
optimization algorithms. 

IV. Relative value oe eorecasts 

In this section we aim to quantify the value of the given 
forecasts |(3)| to the planner who needs to solve the robust 
planning problem |( 8 )| In particular, |(3)| defines a set of n 
given forecasts and we are interested in determining how 
valuable information does each one of them provide to the 
planner. To this end, we examine how sensitive the robustly 
optimal objective 'P*{e), defined in |( 8 )| is to the given 
forecast values e, i.e., what would the objective value become 
for deviations from the given parameters e. 

The forecast parameters e determine the objective P*{e) 
defined in |( 8 )| via the set of distributions T{£) appearing 
in the constraint of the inner minimization. To examine the 
sensitivity of P*{£) when the given forecast parameters e 
change to some new values e — Ae for some Ae G R", 
with the negative sign chosen for convention, we leverage the 
problem reformulation provided by Theorem [T] We exploit 
the well-known convex optimization fact that in general the 
optimal values of the Lagrange dual variables A express the 
sensitivity of the objective of a convex optimization problem 
with respect to changes in the constraints - see [11, Ch. 5.6]. 
In our case, the robust planning |( 8 )| involves a two-layer 
(max-min) optimization objective, but a similar sensitivity 
result can be obtained. We state this result in the following 
theorem. 

Theorem 2. Under Assumption [7] for any Ae G R" it holds 
that 

n 

P* (e - Ae) > P* (e) + ^ A* (e) Ae„ (23) 

where A*(e), for i = 1,...,7t,, is the optimal solution to 
problem |(9)| 

Proof. Along with A* (e) and already defined b* (e), let 77 * (e) 
be a corresponding optimal solution to problem |(9)|(11)| 
Under Assumption [T] by Theorem [T] i.e., the equivalence 
of |( 8 )| and 1(9)11(11)1 we have for the optimal objective that 

n 

P*(e) = -^A:(e)e,-77*(e), 


maximize 
bee. a>o,77 

subject to 


(24) 































as well as 

n 

J{x,b*{e)) + A* (e) gt {x) +g*{e)>0, (25) 

i=l 

for edl X € X by the feasibility to constraints |(10)| 

Fix any Ae € R" and consider P*{e — Ae) defined by 
1(8)1 By definition we have that 

P*(e —Ae)=sup inf Exr^FJ(x,b) 
bee FGFie-Ae) 

> inf Exr^F J(x,b*(e)), ( 26 ) 

where the last inequality follows because b* (e) is in general 
a suboptimal choice for b € B. Following arguments similar 
to those in the proof of Theorem [T] we can show that the 
optimal value of the minimization on the right hand side of 
1(26)1 is lower bounded by the optimal value of its Lagrange 
dual maximization problerrQ- Replacing this lower bound in 
1(26)1 we have that 

n 

P*(e — Ae) > maximize — Xi (e^ — Aej) — g (27) 

~ i—1 

n 

subject to J{x, b*{e)) + ^ Xi gi{x) 

i=l 

+ p > 0, for all X & X. 

(28) 

Note now that A* (e) and g* (e) are feasible solutions for the 
last optimization problem as follows by |(25)| hence we have 
that 

n 

P*{e-Ae) > - '^X*{e) {si - Asi) -g*{e) 

n 

= P*(P(e))+^A:(e)Ae. (29) 

i=l 

where the last equality follows by |(24)| Consequently the 
statement of the theorem holds. □ 

The theorem provides a lower bound on the optimal 
planning objective value of |(8)| after changes in the forecast 
bounds e provided in |(3)| The change in the lower bound 
compared to P*(e) is proportional to the change from e to 
e—Ae. The rate of change in planning objective units in each 
direction i is given by the optimal value of the (dual) variable 
A*(e) for each element i = 1,..., n. We can thus interpret 
the values A* (e) as indicators of how valuable each one of 
the forecasts |(3)| is to the planner. A forecast i with a high 
value A* (e) is more informative compared to other forecasts 
j ^ i because a small change in the given forecast bound 
Si gives a significant change in the (lower bound of the) 
objective. We also note that the values A*(e) have already 
been obtained during the computation of the robust planning 
by |(9)| hence no further computation is required to determine 
them. 

*In fact in Theorem[T]we show that under Assumption[T]the two optimal 
values are equal for Ae = 0, but here for Ae 7 ^ 0 the equality might not 
hold. 


Remark 1. The sensitivity analysis provided by Theorem |2] 
is performed with respect to the given forecast parameters e, 
i.e., they are obtained locally for e. At some other forecast 
parameter e', the sensitivities, i.e., the values A*(e') will 
differ. This difference matches the intuition that different 
forecasts provide different information. On the other hand 
the theorem provides a lower bound on the objective value 
at any deviation e — Ae € R", not just locally around e. It 
is also worth noting that the theorem does not provide any 
guarantee on how far the lower bound is with respect to the 
actual objective for such deviations. 

Given the sensitivity-based characterization of how infor¬ 
mative some given forecasts are, we next propose a method 
for refining/updating the forecasts in a way that increases the 
utility of the decision-making problem. 

A. Forecast refinements 

Suppose the decision-maker has access to a forecasting 
oracle, which upon request can refine one of the n given 
forecasts in |(3)| - see Remark |2] Inspecting the particular 
type of forecasts we consider in |(3)| written as inequalities, 
a refined forecast of some inequality i can be represented 
by decreasing the bound to a lower value £i — Aei. 
Equivalently, such decrease has the effect of shrinking the set 
J^(e) of possible probability distributions that the decision¬ 
maker has to consider, as defined in |(7)| to a subset P{e — 
As) C T{£). 

A question that arises in this context is how the decision¬ 
maker should select which of the n forecasts of |(8)| to 
refine. This question is particularly important if the cost of 
refining forecasts is high, e.g., the computational cost of 
running extra simulations of a forecasting algorithm. The 
characterization of Theorem |2] suggests that the forecast that 
provides the most valuable information should be selected, 
i.e., the forecast i with the highest value A* (e). Based on this 
intuition we propose the iterative forecast refinement proce¬ 
dure shown in Algorithm [T] On every iteration sensitivity 
values are obtained by solving the planning problem, and a 
refinement for the forecast bound with the highest sensitivity 
is requested (e.g. from the oracle) while all other bounds 
are kept constant. The procedure terminates, for example, if 
no further refinements are possible, or if the objective value 
improvements become insignificant. 

Even though this procedure is motivated mainly by in¬ 
tuition, without theoretical claims in terms of optimality, 
convergence, etc., we note the following fact. On each 
iteration, according to |(23)| of Theorem |2] the robustly 
optimal utility value increases by at least an amount equal 
to A*(e(fc)) {£j{k -I- 1) — £j{k)). This minimum increase is 
proportional to the amount of change in the selected forecast 
parameter. If the selected forecast j cannot be refined, i.e., 
ej{k + 1) = £j{k), then no improvement in decision-making 
is achieved. In general, however, the result matches the 
intuition that with a better forecast, i.e., imposing a more 
constraining set P{£ — Ae) in the minimization of |(8)| the 
objective value of the planning can only improve, i.e., the 
optimal value of |(8)| cannot decrease. 














Algorithm 1 Sensitivity-driven forecast refinement 
Input: Utility function J{x,b), forecast functions gi{x) for 
i = 1,... ,n, initial forecast parameters e(0) € K" 
Output: Robustly optimal planning decision b 
1: fc ^ 0 
2: repeat 

3: Solve robust planning |(9)||( 1 l)| vv'ith forecast parame¬ 

ters e(fc), and determine optimal planning b*{e{k)) and 
sensitivity values A*(e(fc)) G M" 

4: Select j = argmaxi<i<„ A*(e(fc)) 

5: Request refinement in jth forecast ej{k + l) < ej (k) 

6: Keep £i{k -f 1) = £i{k) for all i ^ j 

7: fc t- fc -I- 1 

8: until Termination condition 

9: return Final planning decision 6*(e(fc)) 


In the following section we present a numerical example 
for a wind power plant participating in an electricity market 
(Example [T]i under prediction interval forecasts (Example |2]). 
We demonstrate the methodology derived by Theorem [T] for 
determining the robustly optimal bidding decision. Addition¬ 
ally we perform the sensitivity analysis developed in this 
section and we implement the sensitivity-driven procedure 
for refining forecasts. 

Remark 2. The methodology for refining forecasts in this 
section is developed without considering a specific forecast¬ 
ing procedure. If, for example, an oracle obtains the forecasts 
by sampling from the true distribution of the unknown quan¬ 
tity, through simulations of an underlying stochastic model, 
then forecasts can be refined by drawing extra samples. 
However, a large number of extra samples might be required 
to improve upon the estimates of all the integrals in |(3)| 
e.g., in a Monte Carlo fashion. By focusing on only one of 
the n integrals during refinement, as the oracle does in our 
hypothesis, the number of required samples can be reduced. 
This can be performed by variance reduction techniques, 
such as importance sampling [17], which aim at sampling 
more often from values important in estimating the selected 
integral. 

V. Application: Bidding in electricity markets 

UNDER PREDICTION INTERVALS 

In this section we apply our theoretical results in a 
numerical example of bidding renewable energy in electricity 
markets. This problem was introduced in Example [T] with 
the utility function representing the profit for the renewable 
generator and repeated here for convenience, 

J{x,b) = pb — q[b — x]+. (30) 

The utility involves a reward term proportional to the bid 
b with a rate p > 0, and a penalty proportional to the 
generation shortfall [6 — x]+ with a rate q > p. Here 
both X € [0,1] and b G [0,1]. Suppose also the generator 
obtains forecasts about the future generation value x in the 
form of prediction intervals, introduced in Example |2] We 



Fig. 3. Expected profit and worst-case expected profit as a function of 
the chosen bidding for the case considered in the numerical example. The 
worst-case value is always an under-approximation of the tme one. The 
robustly optimal bidding balances between the shortfall penalties and gains 
from a high bid. 

consider the problem of determining the value of bidding 
that robustly maximizes the expected profit to the generator 
subject to the given forecasts about the renewable generation. 
This problem is mathematically described by the robust 
planning formulation |(8)| To solve this problem we adopt 
the reformulation into a convex optimization problem which 
was presented in Section IIII-AI and in particular takes the 

form [(2T)M 

In our numerical example, we consider forecasts for the 
probability that the random generation x takes values in each 
of m equal intervals that partition the total range of values 
[0,1], i.e., Xi = ijm for i = 0,... ,m - see also Example |2] 
To derive the lower and upper bounds, and 6i respec¬ 
tively according to |(4)1 we adopt the following procedure. 
We fix some true underlying probability distribution of the 
generation x and we produce bounds , 6i by perturbing 
the true value P(a:i_i < x < Xi) computed with the 
underlying distribution. Specifically we consider perturbation 
by a constant amount below and above P(xi_i < x < Xi) 
for each i = 1,... ,m. Then only the resulting lower and 
upper bounds, and Si respectively, are available to the 
generator, not the underlying probability distribution used to 
create them. 

Given m = 6 prediction intervals, shown in Pig. |2| 
we first solve the robust bidding problem, i.e., the con¬ 
vex optimization |(21)|(22)'| for reward and penalty values 
p = l,g = 1.6 in the profit function |(30)| We obtain an 
optimal bidding value b* roughly equal to 0.67. In Pig. [3 
we illustrate the value of the worst-case expected profit, 
i.e., the objective in the planning problem |(8)[ for all bid 
values b as well as the optimal point. Por comparison we also 
illustrate the expected profit computed using the underlying 
true probability distribution of x for all bid values b. The 
worst-case expected profit is an under-approximation of the 
true expected value, since the forecasts provide incomplete 
information about the underlying distribution. The optimal 
choice b* balances between the two extremes 0 and 1, as 
expected by intuition, with 6 = 0 being the most ’’secure” 
bid but yielding a zero expected profit, and 6=1 being the 
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Ai 

A2 

A3 

A4 

A5 

Ae 

Iter 1 

0.66 

0.39 

0.12 

0.00 

0.00 

0.00 

Iter 2 

0.80 

0.53 

0.27 

0.00 

0.00 

0.00 

Iter 3 

0.80 

0.53 

0.27 

0.00 

0.00 

0.00 

Iter 4 

1.07 

0.80 

0.53 

0.27 

0.00 

0.00 


TABLE I 

Sensitivities to upper forecast bounds 
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Iter I 

0.00 

0.00 

0.00 

0.14 

0.41 

0.41 

Iter 2 

0.00 

0.00 

0.00 

0.00 

0.27 

0.27 

Iter 3 

0.00 

0.00 

0.00 

0.00 

0.27 

0.27 

Iter 4 

0.00 

0.00 

0.00 

0.00 

0.00 

0.00 


TABLE II 

Sensitivities to lower forecast bounds 
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Fig. 4. Normalized worst-case profit as a function of bid value after itera¬ 
tions of the forecast refinement algorithm. The plotted values are normalized 
by subtracting the true underlying expected profit. The refinements increase 
the worst-case profit and bring it closer to the true value. The robustly 
optimal biding values are also indicated. 


most ’’risky” bid since it is certain that a shortfall x < b = 1 
will occur. 

Moreover, we examine how the information provided by 
the given forecasts of Fig.|2]is valued by the decision maker. 
Following the development of Section |IV] we examine the 
sensitivity of the robust bidding profit to the set of given 
forecasts. According to Theorem|2]this sensitivity is captured 
by the optimal values of the dual variables. In particular the 
sensitivity to upper and lower bounds of prediction intervals, 
6i and 6^ respectively, for i = 1 ,..., m, is captured by the 
dual variables Ai,..., Am and A^,..., A„ respectively, as 
introduced in Section IIII-AI 

The sensitivity values in our example, as computed by 
solving 1 ( 21 ) 11 ( 22)1 are shown in the first row of each of 
the Tables III El First we observe that at each interval 
i = 1 ,..., m, at most one of the upper or lower bound 
sensitivities Ai, Aj is non-zero. The reason is that at most one 
of two bounds (cf. |(4)| ) is tight for the worst-case distribution 
at the robustly optimal point of problem |( 8 )| Second, from 
the dual values we see that the optimal profit is sensitive to 
the probability of both having a low generation (captured by 
^ 1 )^ 2 , A 3 ) as well as having a high generation (captured by 
^^jAgjAg). In other words information about these events 
is the most informative. However the sensitivity is higher 
in the low generation case because the profit is significantly 
affected by shortfalls in this case. 

Next we adopt the sensitivity-driven methodology devel¬ 
oped in Section |IVl that uses a forecasting oracle to refine 
the given forecasts so that they become more informative for 
the robust bidding problem. Following Algorithm [T1 we itera¬ 
tively select to refine the forecast with the highest sensitivity 
value, i.e., one of the upper or lower bounds 6i and 6^ for 
an interval i in |(4)[ If an upper bound 6i is selected it gets 
decreased, and respectively if a lower bound is selected 
it gets increased. In our numerical example, the resulting 
new bound needs to be consistent with the true underlying 
distribution, so we model the following forecasting oracle. 
Upon request, the oracle can refine each upper or lower 
bound by a constant decrease or increase respectively, and 


the new bound is close enough to the true value so that 
the oracle cannot refine it any further. This model is chosen 
here for simplicity. In general. Algorithm [T1 can be applied 
regardless of how the forecasting oracle operates. 

We perform iterations of Algorithm [T1 and the corre¬ 
sponding sensitivity values on each iteration are shown 
in Tables III HU As already mentioned, Ai has the higher 
value initially, so the upper bound on the first interval is 
refined (i.e., reduced). On the second iteration, after solving 
again the optimal planning |( 21 )||( 22 )I the new sensitivity 
values are obtained and again Ai is the largest. Since the 
forecasting oracle described in our example cannot reduce 
the corresponding bound any further, the bound with the 
second largest sensitivity is selected to be refined, which is 
A 2 . On the third iteration, again Ai, Ai are the largest, but due 
to the oracle model, the third largest A 3 is selected, and so on. 
It is worth noting that after some iterations we observe that 
the sensitivities of all lower bounds Aj become zero. This 
means that currently they are not providing any valuable 
information, and refining (i.e., increasing) them does not 
necessarily offer an improvement in profit. 

Finally, to examine the improvements in the bidding profit 
after these iterations of the refinement algorithm, we plot in 
Fig.lHthe worst-case expected profit, i.e., the objective in the 
planning problem |( 8 )| for all bid values b. For visualization 
reasons the values are normalized by subtracting the true 
expected profit (the latter is larger, as shown in Fig. 0 
so the plotted values are negative). We observe that as the 
forecasts are refined, the worst-case expected profit increases 
and gets closer to the actual expected profit (the baseline 
zero). In other words, the refined forecasts provide a more 
accurate model of the true profit. It is worth noting that the 
largest discrepancy between worst-case and true expected 
value happens at the high bid region, since the worst-case 
scenario assumes that generation takes the lowest possible 
values with the highest possible probability, making the 
associated shortfall costs large. We also note that in this 
example even though the optimal value of the profit increases 
with the forecast refinements, the optimal bidding value 









































shown in Fig. |4] is the same. This is a consequence of the 
piecewise linear utility function in |(30)| For more general 
utility functions the optimal planning decision can change as 
well during the refinements, along with the optimal objective 
values, and our algorithm is able to track these changes. 

VI. Concluding remarks and future work 


measure fi in 


the set 1(15)1 i.e., satisfying 

g^{x) dfi{x) < e'i, i = l,.. 


/ rfM(a;)=4+i- 


n, 


(31) 

(32) 


J X 

First, by Assumption [T] we have that there exists a proba¬ 
bility measure p, > 0 such that 


We examine how planning problems under uncertainty can 
utilize forecasts about uncertain quantities. By equivalently 
expressing forecasts as constraints on the possible proba¬ 
bility distributions that the uncertain quantity can follow, 
the problem is cast as a robust optimization. The optimal 
planning seeks to maximize the expected value of a given 
utility function, integrated with respect to the worst-case 
distribution consistent with the forecasts. A reformulation 
into a convex optimization problem is presented, from which 
we can extract information about how valuable are the 
forecasts in determining the optimal planning decision. 

Under the hypothesis that a forecasting oracle can re¬ 
fine the given forecasts upon request, i.e., that the set of 
possible probability distributions in the robust planning can 
be decreased, an iterative forecast refinement procedure is 
proposed in Section IIV-AI Even though the procedure is 
justified theoretically, in practice the way refinements are 
performed depends on the employed forecasting algorithm. 
For example, it might be the case that many forecasts can 
be refined at the same time, instead of just one at a time 
as we consider, or that the requested forecast cannot be 
refined due to limitations of the oracle and/or because the 
forecast is very close to the true value. Moreover, besides just 
refining the set of given forecasts, as we consider, the oracle 
might generate new additional forecasts. Overall, coupling 
the proposed sensitivity-driven approach with forecasting 
algorithms used in practice requires further exploration. 

More general problem formulations include the study of 
uncertain quantities evolving over time in a potentially corre¬ 
lated fashion, e.g., the random renewable generation during 
a time horizon in the future. Forecasting for such models 
has been considered in, e.g., scenario-based forecasts [18]. 
Furthermore, apart from uncertain quantities spread over 
time, the framework can be extended to include spatially 
distributed models as well, as in the case of correlated 
renewable generations at different physical locations in the 
grid. 


Appendix 


A. Proof of Claim 1 

The proposition equivalently states that we can find i5 > 0 
(the radius of ball B) such that for any e' S ]R"+i that 
satisfies |e' — £i| < b, i = 1,..., n, and l4-ri-l| < 5, i.e., 
e' belongs in ball B, the set |(15)| is non-empty. The proof 
is constructive. We first construct an appropriate ball radius 
S, and then for any point e' G i? in the ball we construct a 


/ gt{x)dg{x) < £i-C, i = l,...,n, 
J X 

/ dp{x) = 1, 

J X 


for some > 0. Define the ball radius to be 

c c 


S = min 


1+ki “ Cl ’ ’ 1 + kn —cr 


1 . 


(33) 

(34) 


(35) 


Fix then any point e' G B. We need to show that 
there exists a signed measure g > 0 such that |(31)| and 
1(32)1 hold. Consider the measure p = e^+i p defined by 
scaling the measure p, i.e., g{S) = ^^^h 

set S in the Borel cr-algebra of the set X. Note that this 
measure is positive since e'^+i > 0, which follows from 
\E'^_f_i — 1| < b < 1. Also this measure satisfies |(32)| since 
dg{x) = e'^+x L = ^n+v 

Hence it remains to show that |(31)| holds. The right hand 
side of |(31)| satisfies e' > Ei - Also, the left hand side of 
1(31)1 satisfies 


/ g^{x) dg{x) = / gi{x) dg{x) < e'„^x “ C), 

"" "" (36) 

because of |(33) and > 0 as we argued previously. 

Hence to show (31)|it suffices to show that e'^^x (Ez “ C) ^ 


Ei — 5 for all* = 1,..., n. 

We consider two cases for the sign of quantity Ei — (. If 

Ei — C > Oz since e'^_^x < 1 + <), we have that — 

0 < (1 + S)(Ei — (). In that case to show |(313| it suffices 
to show that (1 -|- S)(Ei — () < Ei — S. This is equivalent to 
(5(1-1-£i — C) < (, easily checked by our choice of S in |(35)| 

Similarly for the case £i — ^ < 0, since E'^_^_x > 1 — b, we 

have that E'^j^xi^i ~ C) < (1 ~ <5)(Ei — C)- Thus for |(31)| it 
suffices to show that (1 — b)(£i — C) < Ei — S which can be 
again easily checked by our choice of S in |(35)| 
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